IN THE CLAIMS: 

1. (Original) For use in a processor having an at least four-wide instruction issue 
architecture, a mechanism for pipeline processing multiply-accumulate instructions with out-of-order 
completion, comprising: 

a multiply-accumulate unit (MAC) having an initial multiply stage and a subsequent 
accumulate stage; and 

out-of-order completion logic, associated with said MAC, that causes interim results 
produced by said multiply stage to be stored when said accumulate stage is unavailable and allows 
younger instructions to complete before said multiply-accumulate instructions. 

2. (Original) The mechanism as recited in Claim 1 wherein said initial multiply stage 
and said subsequent accumulate stage are single clock cycle stages. 

3. (Original) The mechanism as recited in Claim 1 wherein said out-of-order 
completion logic is contained in a writeback stage of a pipeline in said processor. 

4. (Original) The mechanism as recited in Claim 1 wherein said out-of-order 
completion logic writes back said interim results to at least one register in said MAC before said 
multiply-accumulate instructions arrive at said accumulation stage of said MAC. 

5. (Original) The mechanism as recited in Claim 1 wherein said interim results are 
unavailable to an external program executing in said processor. 
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6. (Original) The mechanism as recited in Claim 1 wherein grouping logic within said 
processor groups said multiply-accumulate instructions based on said mechanism. 

7. (Original) The mechanism as recited in Claim 1 wherein said processor is a digital 
signal processor. 



8. (Original) For use in a processor having an at least four-wide instruction issue 
architecture, a method of pipeline processing multiply-accumulate instructions with out-of-order 
completion, comprising: 

providing a multiply-accumulate unit (MAC) having an initial multiply stage and a 
subsequent accumulate stage; 

causing interim results produced by said multiply stage to be stored when said accumulate 
stage is unavailable; and 

allowing younger instructions to complete before said multiply-accumulate instructions. 

9. (Original) The method as recited in Claim 8 wherein said initial multiply stage and 
said subsequent accumulate stage are single clock cycle stages. 

10. (Original) The method as recited in Claim 8 wherein said causing is carried out in 
a writeback stage of a pipeline in said processor. 

1 1 . (Original) The method as recited in Claim 8 wherein said causing comprises writing 
back said interim results to at least one register in said MAC before said multiply-accumulate 
instructions arrive at said accumulation stage of said MAC. 

12. (Original) The method as recited in Claim 8 wherein said interim results are 
unavailable to an external program executing in said processor. 
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13. (Original) The method as recited in Claim 8 further comprising grouping said 
multiply-accumulate instructions based on said mechanism, 

1 4. (Original) The method as recited in Claim 8 wherein said processor is a digital signal 
processor. 
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15. (Original) A digital signal processor (DSP), comprising: 

a pipeline having stages and capable of processing multiply-accumulate instructions; 

an instruction issue unit containing grouping logic and at least four-wide instruction issue 

logic; 

a multiply-accumulate unit (MAC), coupled to said instruction issue logic, having an initial 
multiply stage and a subsequent accumulate stage; and 

out-of-order completion logic, associated with said pipeline, that causes interim results 
produced by said multiply stage to be stored when said accumulate stage is unavailable and allows 
younger instructions to complete before said multiply-accumulate instructions. 

16. (Original) The DSP as recited in Claim 1 5 wherein said initial multiply stage and said 
subsequent accumulate stage are single clock cycle stages. 

17. (Original) The DSP as recited in Claim 15 wherein said out-of-order completion 
logic is contained in a writeback stage of said pipeline. 

18. (Original) The DSP as recited in Claim 15 wherein said out-of-order completion 
logic writes back said interim results to at least one register in said MAC before said multiply- 
accumulate instructions arrive at said accumulation stage of said MAC. 

19. (Original) The DSP as recited in Claim 15 wherein said interim results are 
unavailable to an external program executing in said DSP. 
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20. (Original) The DSP as recited in Claim 1 5 wherein said grouping logic groups said 
multiply-accumulate instructions based on said mechanism. 
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